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Abstract 

Energy harvesting (EH) nodes can play an important role in cooperative communication systems 
which do not have a continuous power supply. In this paper, we consider the optimization of conventional 
and buffer-aided link adaptive EH relaying systems, where an EH source communicates with the desti- 
nation via an EH decode-and-forward relay. In conventional relaying, source and relay transmit signals 
in consecutive time slots whereas in buffer-aided link adaptive relaying, the state of the source-relay and 
relay-destination channels determines whether the source or the relay is selected for transmission. Our 
objective is to maximize the system throughput over a finite number of transmission time slots for both 
relaying protocols. In case of conventional relaying, we propose an offline and several online joint source 
and relay transmit power allocation schemes. For offline power allocation, we formulate an optimization 
problem which can be solved optimally. For the online case, we propose a dynamic programming (DP) 
approach to compute the optimal online transmit power. To alleviate the complexity inherent to DP, 
we also propose several suboptimal online power allocation schemes. For buffer-aided link adaptive 
relaying, we show that the joint offline optimization of the source and relay transmit powers along with 
the link selection results in a mixed integer non-linear program which we solve optimally using the 
spatial branch-and-bound method. We also propose an efficient online power allocation scheme and 
a naive online power allocation scheme for buffer-aided link adaptive relaying. Our results show that 
link adaptive relaying provides performance improvement over conventional relaying at the expense of 
a higher computational complexity. 

This paper has been presented in part at the IEEE Vehicular Technology Conference (VTC), Spring, Yokohama, Japan, May 
2012. 
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I. Introduction 

In cooperative communication systems, a source and a number of cooperating relays expend 
their energy for processing and transmitting data. For some applications, connecting the source 
and the relays to the power grid is cumbersome or may even not be possible. Pre-charged batteries 
can be a viable solution to overcome this problem. In practice, the limited storage capacity of 
batteries and high transmit powers may result in quick drainage of the batteries. As a result, 
the batteries need to be replaced/recharged periodically which can be sometimes impractical. 
An alternative solution is the deployment of energy harvesting (EH) nodes. EH nodes harvest 
energy from their surrounding environment to carry out their functions. Energy can be harvested 
using solar, thermoelectric, and motion effects, or through other physical phenomena [Ij. An 
EH node that has used up its stored energy can harvest new energy and become again available 
for transmission. Thus, EH nodes can be regarded as a promising option for deployment as 
they ensure a long system lifetime without the need for periodic battery replacements. In EH 
cooperative systems, the energy can be independently harvested by the EH source and/or EH 
relays during the course of data transmission at random times and in random amounts. For data 
transmission (and for other signal processing tasks), EH nodes expend the energy from their 
storage and only the unused energy remains in the batteries. In particular, at each time slot, the 
source and the relays are constrained to use at most the energy available in their storage. These 
constraints necessitate the design of new transmission strategies for the source and the relays to 
ensure optimum performance in an EH environment. 

Recently, transmission strategies for and performance analyses of EH nodes in wireless com- 
munication systems have been provided in GD-0. In J2|, a single source-destination non- 
cooperative link with an EH source was considered and an optimal offline along with an optimal 
and several sub-optimal online transmission policies were provided for allocating transmit power 
to the source according to the random variations of the channel and the energy storage conditions. 
In 0, a similar system model was considered, where dynamic programming (DP) was employed 
to allocate the source transmit power for the case when causal channel state information (CSI) was 
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available. Several higher layer issues such as transmission time minimization and transmission 
packet scheduling in EH systems were considered in JSJ — [[V]| . The deployment of EH sensors in 
sensor networks has been extensively discussed in the literature HI, [H. The use of EH relays in 
cooperative communication was introduced in 01, where a comprehensive performance analysis 
was performed for relay selection in a cooperative network employing EH relays. A deterministic 
EH model (assuming a priori knowledge of the energy arrival times and the amount of harvested 
energy) for the Gaussian relay channel was considered in ifTOl . where delay and non- 
delay constrained traffic were studied. The concept of energy transfer in EH relay systems was 
considered in IfTTTl . where an offline power allocation scheme was proposed. 

In this paper, we consider a simple single link cooperative system where the source com- 
municates with the destination via a decode-and-forward (DF) relay. We assume that the relay 
operates in half-duplex mode. In most of the existing literature on half duplex relaying 01, 
flU, [fT2ll . |fT3l , it is assumed that relays receive a packet in one time slot from the source and 
forward it in the next time slot to the destination. We refer to this approach as "conventional" 
relaying throughout the paper. Recently, it has been shown in ifLUl . lfT5l that equipping relays 
with buffers can improve the performance of cooperative communication systems. In fact, using 
buffers at the relays allows storage of packets temporarily at the relay if the relay-destination 
channel condition is not good enough until the quality of the channel has sufficiently improved. 
In |fT3Tl , a buffer-aided adaptive link selection protocol was proposed. This protocol gives relays 
the freedom to decide in which time slot to receive and in which time slot to transmit. 

In this paper, we assume that the source and the relay are EH nodes and consider both 
conventional and buffer-aided link adaptive relaying. For both protocols, we propose offline and 
online (real-time) power allocation schemes that maximize the end-to-end system throughput 
over a finite number of transmission slots. The offline schemes are of interest when the amount 
of harvested energy and the channel signal-to-noise ratio (SNR) for all transmission slots are 
known a priori. However, in practice, the amount of harvested energy and the channel SNR 
are random in nature and cannot be predicted in advance. Therefore, in this case, online power 
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allocation schemes have to be employed taking into account the available knowledge of channel 
SNR and harvested energy. Nevertheless, considering offline schemes is still important since 
they provide performance upper bounds for the practical online schemes. For conventional 
relaying, we propose an optimal online power allocation scheme which is based on a stochastic 
DP approach. To avoid the high complexity inherent to DP, we also propose several sub- 
optimal online algorithms. In case of buffer-aided link adaptive relaying, we formulate an offline 
optimization problem that jointly optimizes the source and the relay transmit powers along with 
the link selection variable. Thereby, the link selection variable indicates whether the source or the 
relay is selected for transmission in a given time slot. The optimization problem is shown to be a 
non-convex mixed integer non-linear program (MINLP). We propose to use the spatial branch- 
and-bound (sBB) method to solve the offline MINLP problem optimally Oill-fflBl- We also 
propose a practical online power allocation scheme for the buffer-aided link adaptive relaying 
protocol. We note that our buffer-aided link adaptive protocol is significantly different from the 
delay constrained model in @. The model in assumes the presence of a buffer at the relay 
but does not consider adaptive link selection. 

The remainder of this paper is organized as follows. In Section [TTl the system model for the EH 
system is presented. Different power allocation schemes for conventional and buffer-aided link 
adaptive relaying are provided in Sections [TIT] and [IVj respectively. In Section |Vj the effectiveness 
of these power allocation schemes is evaluated based on simulations. Section [VTJ concludes this 
paper. 

II. System Model 

We consider an EH relay system, where the source, S, communicates with the destination, D, 
via a cooperative relay, R, as shown in Fig. [H Both S and R are EH devices and their participation 
in signal transmission and processing depends on the harvested energy. The harvested energy 
can be of any form, e.g. solar, wind, or electro-mechanical energy. S and R are equipped 
with batteries, which have limited storage capacity and store the harvested energy for future 
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Battery Battery 



Fig. 1. System model for a single link S-R-D system where S and R are EH devices. S and R have batteries with finite 
storage which can store the energies harvested from the energy sources. The rectangle boxes in S and R represent the batteries 
and the hashed areas represent the amount of energy stored. 

use. In particular, the batteries of S and R can store at most Bs, m ax and Bn tmax units of 
energy, respectively. We assume the transmission is organized in time slots of duration T. In the 
following, without loss of generality, we set T = 1, and the system transmits for K time slots. 
In the next two subsections, we discuss the signal model, the system throughput, and the battery 
dynamics for conventional and buffer-aided link adaptive relaying. 

A. Conventional Relaying 

Signal Model: In conventional relaying, during the first time slot, S transmits and R receives, 
and during the second time slot, R transmits and D receives. This sequential process continues 
for K time slots. Here, K is assumed to be an even number. The received packet at R in the 
(2k — l)th time slot, where k G {1, 2, • • • , K/2}, is modelled as 

UR,2k-i = h s ,2k-\X2k-i + nn t 2k-i, (1) 

where hs^k-i is the fading gain of the S-R link, and n^ik-i denotes the noise sample at R. 
The transmitted packet a?2fc-i contains Gaussian-distributed symbols. Assuming DF relaying, the 
detected packet, x 2 k, is transmitted from R during time slot 2k. Thus, the received packet at D 
is given by 

UD,2k = hpt t 2kX2k + n D,2k, (2) 
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where h Rj2k and no^k denote the fading gain of the R-D link and the noise sample at D, respec- 
tively. h St 2k-i and h Rj2k can follow any fading distribution, e.g. Rayleigh, Rician, Nakagami-g, 
or Nakagami-m fading. n R:2k -i and n Dy2k are additive white Gaussian noise (AWGN) samples 
having zero mean and unit variance. We assume the channels are quasi-static within each slot and 
the channel SNRs of the S-R and the R-D links are 75,2^-1 and respectively. In particular, 
7s,2fc-i = \hs,2k-i\ 2 and 7^ = \h R)2k \ 2 - We assume 7^-1 and <y R>2k to be independent and 
identically distributed (i.i.d.) over the time slots. Furthermore, 7s,2fc-i and 7^,2/0 are mutually 
independent but not necessarily identically distributed (i.n.d.). For future reference, we introduce 
the average SNRs of the S-R and the R-D links as 75 and j R , respectively. 
System Throughput: When x 2 k-i is transmitted from S with transmit power Ps^k-i during 
time slot 2k — 1, 

fs,2fc-l = l0g 2 (1 + lS,2k-lPs,2k-l) (3) 

bits of data can be transmitted error-free via the S-R link. Similarly, when x 2k is transmitted 
from R with transmit power P Ry2 k, 

$R,2k = log 2 (1 + lR,2kPR,2k) (4) 

bits of data can be transmitted via the R-D link. It is worth mentioning that during the 2A;th 
and the (2k — l)th time slots, S and R, respectively, do not transmit any data, i.e., Ps i2 k = 
and P R:2 k-i = 0. We assume that R ensures error-free detection by employing an appropriate 
error correction coding scheme and hence x 2k = x 2 k-i- Therefore, the end-to-end (S-D) system 
throughput is given by | min{^5 j2 fc-i, £,R,2k} bits/s/Hz where the factor | is due to the half-duplex 
constraint. 

Battery Dynamics: The energies stored in the batteries of S and R in time slot k are denoted by 
P>s,k and B Rjk , respectively. The transmit powers of S and R axe limited by the battery energies, 
i.e., < Ps,2k-i < B S2k -\ and < P R ^ 2 k < B R 2k . We assume throughout this paper that the 
energy consumed by the internal circuitry of S and R is negligible compared to the transmit power 
0. The energy harvester at S collects Hs, m < B s ,max units of energy during the mth time slot, 
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where m £ {1, 2, • ■ • , K}. Similarly, the energy harvester at R collects H R ^ m < B R ^ max units of 
energy during the mth time slot. It is worth noting that energies are harvested during every time 
slot m at S and R regardless of Ps, m and P RjTri . Let H s ,e — E{H s , m } and H R:E = E{H R>m } 
denote the average energy harvesting rate of S and R over the time slots, respectively. Here, 
E{-} denotes statistical expectation. Because of the spatial separation of S and R, we assume 
Hs, m and H R m are independent of each other and i.i.d. over the time slots. Similar to O, we 
assume the stored energies at S and R increase and decrease linearly provided the maximum 
storage capacities, Bs, ma x and B R ^ max , are not exceeded, i.e., 

Bs,m+1 = m in{(-B5 >rn — Ps ym + Hg >m ), Bs } max}, Vm (5) 

B R:m+ i = min{(B R m — P R)Tn + H R ^ m ), B R ^ max } } Vm. (6) 

Furthermore, B S i = H$ t o > and B R i = H R > 0, respectively, denote the available energies 
at S and R before transmission starts. 

B. Buffer-Aided Adaptive Link Selection 

Signal Model: For buffer-aided link adaptive relaying, the relay R is equipped with a buffer in 
which it can temporarily store the packets received from S. In this approach, one of the nodes 
decides whether S or R should transmit in a given time slot, k £ {1, 2, • • • , K}, based on the 
CSI of the S-R and the R-D links |[T5l . Therefore, unlike conventional relaying, in any time 
slot k, S or R can transmit packets. Let d/- E {0, 1} denote a binary link selection variable, 
where <4 = (<4 = 1) if the S-R (R-D) link is selected for transmission. When c4 = 0, the 
received packet at R is given by 

VR,k = h s ,kXk + n R}k . (7) 
On the other hand, when c4 = 1, the received packet at D is given by 

VD,k = h R)k x k + n D)k . (8) 
System Throughput: When d k — 0, S is selected for transmission and 

= (1 - 4)log 2 (1 + is, k Ps,k) (9) 
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bits of data can be transmitted error-free via the S-R link. Hence, R receives £s,k data bits 
from S and appends them to the queue in its buffer. Therefore, the number of bits in the buffer 
at R at the end of the A;th time slot is denoted as Qk and given by Qk = Qk-i + £s,k- However, 
when dk = 1, R transmits and the number of bits transmitted via the R—D link is given by 

£ Rtk = min{41og 2 (1 + j R , k PR,k) , Qk-i}- (10) 

It is worth noting that the maximal number of bits that can be sent by R is limited by the number 
of bits in the buffer and the instantaneous capacity of the R—D link 031 • The number of bits 
remaining in the buffer at the end of the kth time slot is given by Q k = Q k -i — £,n,k- We assume 
that S has always data to transmit and the buffer at R has very large (possibly infinite) capacity 

K 

to store them. Therefore, a total of ^ ^ k bits are transmitted from S to D during the entire 

fc=i 

transmission time. 

Battery Dynamics: The battery dynamics for the link adaptive transmission protocol are identical 
to those for conventional relaying. 

III. Power Allocation for Conventional Relaying 

In this section, we propose an offline and several online power allocation schemes for the 
considered EH system with conventional relaying. 

A. Optimal Offline Power Allocation 

Our objective is to maximize the total number of transmitted bits (from S to D) delivered by 
a deadline of K time slots over a fading channel assuming offline (prior) knowledge of the full 
CSI and the energy arrivals at S and R in each time slot. The resulting maximization problem is 
subject to a causality constraint on the harvested energy and the (maximum) storage constraint 
for the batteries at both S and R. 

The offline optimization problem for maximizing the throughput of the considered system for 
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K time slots can be formulated as follows: 

K/2 

max ^ m M£s,2fc-i,6l,2jfc} (11) 

fc=i 

l 2(1-1) 

s.t. ^(^S,2fc-1 + A s>2fc _i) < H s,k, V/ (12) 

k=l k=0 
I 21-1 

J2(P R ,2k + X R , 2k )<J2 H R,k, V/ (13) 

k=l k=0 
2m— 1 m 

H s,k - J2(Ps,2k-i + A 5 , 2 fc-i) < B S:max , Vm (14) 

k=0 k=l 
2m m 

5^ H R,k - y^(-PR,2fc + A Ri2 fc) < B R>max , Mm (15) 

fc=0 fc=l 

75,2fc-l-Ps,2fe-l = lR,2kPR,2k, V/c, (16) 

where T = {Ps,2fc-i, ^ii,2fe, As ;2fc _i, A^IA; G {1, 2, • • • , K/2}}. Also, V/, Vm, and VA; stand for 
/ G {1, 2, • • ■ ,K/2}, m G {1, 2, ■ • • ,K/2-l}, and fc G {1,2, ■ •• , K/2}, respectively. The slack 
variables \s,2k-i and \ Ry 2k ensure that constraints (fT4l)-(fT6l) can be met for all realizations of js,k, 
7^ ; fc, Hg k , and H R k . In particular, these slack variables represent the power (possibly) wasted 
in each transmission interval! Constraints (fl"2l) and <fl"3l) stem from the causality requirement on 
the energy harvested at S and R, respectively. Moreover, (fl4l) and (fl"5l) ensure that the harvested 
energy does not exceed the limited storage capacity of the batteries at S and R, respectively. 
Constraint (fTBl ) ensures that the amount of information transmitted from S to R is identical to 
that transmitted from R to D so as to avoid data loss at R. Constraint (fTBl ) is required since we 
assume individual power constraints for S and R. This is a reasonable assumption since S and 
i? have independent power supplies. 

'For example, in case of small Bs, ma x and Bji,max and large Hs,e and Hr,e, constraints d 14b and J 1 5b can become 
infeasible if \s,2k-i an d A^^fc are not introduced. As,2fe-i and \n,2k in the constraints avoid infeasibility of the problem and 
represent the amounts of energy that cannot be used (by 5* and R) in each time slot. 



September 12, 2012 



DRAFT 



Using (fT6l) in (TTTT) — <fT3T), the considered offline optimization problem can be rewritten as: 



max 



S.t. 



K/2 

X>,2*-i (17) 

k=l 

D \ 21-1 

75,2fc-l-T5,2fc-l 



2m m ( P \ 

,max ) 

Vm. (19) 



fc=0 fc=l 

Constraints (USD and (E 



where V = T \ {P R , 2 k\^k}. The problem in (fl7T>-(fl9l with (O and (O forms a convex 
optimization problem and the optimum solution can be obtained either in closed form by using the 
Karush-Kuhn-Tucker (KKT) conditional or by using any standard technique for solving convex 
optimization problems |[T9~ll , EOl . Let -P<j 2 fe-i denote the optimum solution of the considered 
optimization problem. The optimum P Ri2 k can be obtained as 

p * _ 7S,2A=-1-Pg, 2 fc-1 nm 

r R,2k — 7 • \ ZV} ) 

lR,2k 

B. Optimal Online Power Allocation by DP 

In practice, only causal information about channels and harvested energies is available for 
power allocation. Therefore, the offline power allocation scheme is not readily applicable as, at 
a given time slot, the future CSI and the upcoming harvested energy are not known in advance. 
We propose to employ a stochastic DP approach for optimum online power allocation ||3), [|2Ti . 

Let C 2 fc-l,2fc — (7S,2fc-l, lR,2ki {Hs,2(k-1) + Hs,2k-3), {H R ,2k-l + -Hfl,2(fc-1)), Bs,2k-1, B R pk) 

denote the state for time slots 2k — 1 and 2k. We note that Hs : k = for k < 0. Our aim 
is to maximize the total throughput over K time slots. We assume the initial state c x 2 = 

(75,i, 7r,2, H s ,o, (H Rj0 + H R>1 ), B s ,i, B R>2 ) is always known. We define a policy p = {(P s , 2 k-i 
(c2fc-i,2fc),/ J R, 2 fc(c2fc-i,2fc)),Vc 2 fc_i i 2fc, k = 1, 2, • • • , K/2} as feasible if the energy harvesting 

2 It can easily be shown that optimization problem J 1 7b — J 1 91> with J12t and d 1 4b has a water-filling solution. Due to space 
limitation, we omit the solution here. 
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constraints < Ps, 2fc _i(c 2fc _ li2fc ) < B S)2k -i and < P R , 2k (c 2k -i,2k) < B R>2k are satisfied for all 
k. Hence, the objective function to be maximized can be reformulated as 

K/2 

R(p) = j2E{mm{e S:2k ^,e R ,2k}\ci,2,p}, (21) 
k=i 

where we defined ^ |2 fe-i - lo S2 (1 + Js,2k-iPs,2k-i(c2k-i,2k)) and £ R2k = log 2 (l + j R , 2 kPR,2k 
( c 2fc-i,2fc))- The expectation is with respect to the SNRs of the channels and the harvested 
energies. In particular, for a given c x 2 , the maximum throughput can be obtained as 

R* = max R(p), (22) 

where V denotes the space of all feasible policies. 

The maximum throughput during time slots 2k— 1 and 2k is denoted by J 2k -i :2k (B S:2k -i, B R2k ). 
For a given ci )2 , the maximum throughput, Ji, 2 (Bs t i, B R)2 ), can be recursively obtained from 

Jk-i,k(Bs,k-i, Brjc), J K - 3jK - 2 (B s ,k-3, B RtK _ 2 ), J 3 a(B s ,3,B ra ) 0. For the last two 
time slots K — 1 and K , we have 

Jk-i,k(B s ,k-i, B RjK ) = max ]- min {£ s ,K-i, £r,k} (23) 

0<Ps,A'-l<Ss,A'-l Z 

0<Pr,k<B r , k 
1s,k-iPs,k-i=1r,kPr,k 

and for time slots 2k — 1 and 2k, we obtain 



J2k-i,2k(B St2 k~i, B R)2k ) = max — min {Cs , ,2fc-i, CK,2fc} 

0<Ps,2fc-l<-DS,2fc-l ^ 

0<PR,2k<B R/2k 

7S,2k-lPs,2k-l="/R,2kPR,2k 

+ <^2fc+l,2fc+2 ,2fc-l — Ps 5 R , 2fc -P fl , 2fc ),(24) 



where J 2k+ i t2k+2 (B' s ^ 2k+1 , B' R2k+2 ) — 

E "rs,2k+i,iR,2k + 2,Hs,2k-i,H R ,2 k { ^2fc+i,2fc^min{£^ 2fe+1 + #s i2fe _i, ^s.max}, 

min{5^, 2fc+2 + H R}2k , B R 

,max })}• (25) 

Here, B' S2k+1 = B s ,2k-i - Ps,2k-i, B' R2k+2 = B Rj2k - P Ri2k , ^ s ,2k+\ (lR,2k+2) represents the 
SNR of the S-R (R-D) link in the (2k + l)th ({2k + 2)th) slot given the SNR j s>2k -i (-y R>2k ) 
in the (2k — l)th (2/cth) slot, and Hs,2 k -i (H R ,2 k ) denotes the harvested energy at S (R) in the 
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(2fc-l)th (2fcth) slot given the harvested energy Hg^k-2 (Hn,2k-i) m me (2/c — 2)th ({2k — l)th) 
slot. It can be shown that the cost functions in (1231) and (1241) are concave in Ps,2k-i and Pr^h- 
Thus, (1231) and (1241) are convex optimization problems and can be solved very efficiently |fT9l . 
Further simplification of (l23l) yields 

Jk-i,k( B s,k-i, B r . k ) = - log 2 (1 + ls,K-ipK-x) , (26) 

where p K _ x = mm{B S)K _ 1 ,'y R)K BR K /'y S)K _ 1 }. Therefore, P^k-i = vsxax{B StK -i, ^K-f }' 
and P RK follows from (|20l ). Similarly, (1241) can be simplified as 

J2k-i,2k(B>s,2k~i, BR2k) = max -£s,2fc-i 

0<Ps,2fc-l<mm{S Si2 fc-i,7ii,2fc-BH,2fe/7S,2fe-i} ^ 
+ ^2fc+l,2fc+2(-Ss > 2/c-l — Ps,2k-1, B R2 k ~ 1 S ,2k-\P S .2k~\ ll R.2k) ■ (27) 

Using ([26]) and ([27]), Pg^k-i and ^fl,2fc> & e {1,2, ••• ,if/2}, can be obtained for different 
possible values of 75^, 7/j.fc, B S k , and and can be stored in a look-up table. This is done 
before transmission starts. When transmission starts, for a given realization of 7s,2fc-i, 7ii,2fc, 
Be 

2k-i, and B Ri 2k in time slots 2k — 1 and 2k, the values of Kj 2 fc-i and -^R.2fc corresponding 
to that realization are taken from the look-up table. 

C. Suboptimal Online Power Allocation 

In the proposed DP-based optimal online power allocation scheme, for a certain transmission 
time slot, we consider the average effect of all succeeding time slots, c.f. (l25l) . Due to the 
recursive nature of DP, the computational complexity of this approach increases alarmingly with 
increasing K. For this reason, in the following, we propose three different suboptimal online 
power allocation schemes, which perform close to the optimal DP approach but have reduced 
complexity. 

1) Suboptimal Simplified DP Power Allocation ("DP— 12" and "DP- -Ii" Schemes): In this 
scheme, we use the average effect of only 2 (or 4) following time slots to allocate the transmit 
power in each time slot. In particular, we assume for the current time slot that all energies have 
to be spent over the following 2 (or 4) time slots. Moreover, in the last two time slots, either 
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S or R uses up all of its stored energy. This scheme reduces the computational complexity at 
the expense of a performance degradation. We refer to the suboptimal DP schemes taking into 
account 4 and 2 time slots as "DP-I 2 " and "DP-Ii", respectively. 

2) Suboptimal Harvesting Rate (HR) Assisted Power Allocation ("HR Assisted" Scheme): In 
this scheme, we constrain the transmit powers Ps,2k-i an d Pn.2k by the average energy harvesting 
rates H$,e an d Hr,e, respectively. This scheme is referred to as "HR Assisted" power allocation. 
For a given time slot k G {1, 2, • • ■ , K/2 — 1}, the resulting optimization problem can be stated 
as: 

max £s,2fc-i (28) 

P S ,2k-l>0 

s.t. P S ,2k-l < £s,2*-i, Ps,2k-i < Hs,e, (29) 

-T5,2fe-1 S , "S,2k-l S (jU) 

75,2fc-l 7S,2fe-l 

The optimum solution for this optimization problem is given by 

D* r D rj 7R,2kBR,2k lR,2kHRE ,r, x 

" S 2fe-i = min{B Si2 fc-i, , }> (31) 

7S,2fe-l 75,2^-1 

and P^ 2 fc ^ s obtained from (l20l) . For the Jith time slot, we ensure that either S ox R use up all 
of their stored energy. 

3) Suboptimal Naive Power Allocation ("Naive" Scheme): In this suboptimal "naive" ap- 
proach, for each time slot, only the stored energies at hand determine the transmit power, i.e., 
this approach does not take into account the effect of the following time slots, and the transmitting 
node (S or R) uses up all of its available energy in each transmission interval. To be specific, 
for a particular time slot k G {1, 2, ■ ■ • ,K/2}, P^k-i = ram{B s ,2k-i, l3 f^f- }^ and P R,2k 
follows directly from d20l) . 

D. Complexity 

In the offline and the optimal online power allocation schemes, we solve convex optimization 
problems where the number of constraints is a function of K. The required computational 
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complexity to solve a convex optimization problem depends on the method used (e.g. bisection 
method, interior-point-method, etc.) and is polynomial in the size of the problem |fT9l . Therefore, 
the worst-case computational complexity of the offline power allocation scheme for conventional 
relaying is polynomial in the number of time slots K [fT9l . We observe from (l24l) and (|25T) that, 
in the optimal DP based online power allocation scheme, the optimal transmit powers of S 
and R at a given time slot depend on the average data rate of the next time slot. We consider 
different realizations of channel SNRs and harvested energies for the next time slot to obtain the 
average data rate. Furthermore, the data rates corresponding to each of the realizations of channel 
SNRs and harvested energies in the next time slot are also functions of the average date rates 
in the second next time slot. This dependency spans up to the last time slot and therefore, the 
complexity of the optimal online power allocation scheme increases exponentially with K. The 
less complex versions of DP, i.e., DP-I 2 and DP-Ii have linear complexities in K. Moreover, 
the naive and the HR assisted suboptimal online schemes have linear complexities in K. Note 
that we compare the complexity of the proposed offline and online power allocation schemes in 
terms of the required average execution time in the simulation results section. 

IV. Power Allocation for Buffer-Aided Adaptive Link Selection 

In this section, we propose offline and online power allocation schemes for EH systems with 
buffer-aided adaptive link selection. 

A. Offline Power Allocation 

Like for conventional relaying, our goal is to maximize the total number of transmitted bits 
(from S to D) delivered by a deadline of K time slots for the link adaptive transmission protocol. 
The offline (prior) information about the full CSI and the energy arrivals at S and R in each 
time slot are assumed to be known in advance. The offline optimization problem for the link 
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adaptive transmission protocol can be formulated as follows: 

K 

^41og 2 (l + 7^if,fc) (32) 



max 

T>0,{d k \\/k} 



k=l 

q q-1 



S.t. 



^((1 - d k )P s , k + \ s ,k) < H s,k, Vg (33) 

k=l k=0 
q q-1 

J2(d k P R ,k + Ah,*) < H R,k, Vg (34) 

k=l k=0 

v v 

^Hst-^dl-d^Pst + Xs^^Bs^, Wv (35) 

k=0 k=l 

v v 

53 HR,k — ^^(dkPR t k + ^R,k) < Br 

,max j (36) 

fc=i fc=i 
q q 

53(1 - 4) log 2 (1 + 75,fcPs,fe) > 53 4 lo S2 (! + !R,kP R ,k) , Vg (37) 

fe=l k=l 

4(1 - 4) = 0, Vfc, (38) 

where Vg, V/c, andVv stand for g e {1,2, • ■ • , K}, k e {1, 2, • • • and v G {1,2, • ■ ■ ,K-1}, 
respectively. Like for conventional relaying, constraints (I33l)-d36l) ensure the energy causality and 
limited energy conditions for buffer-aided link adaptive relaying. Constraint (|37l) ensures that R 
cannot transmit more bits than it has in its buffer. Moreover, (|38l) ensures that 4 can only be 
or 1, i.e., either S or R transmits in a given time slot, k E {1, 2, • • • , K}. We note that, in 
this optimization problem, although we are maximizing the throughput of the R-D link, using 
constraint (|37l) incorporates the effect of the throughput of the S—R link. As £s,k an d £,R, k are 
increasing functions of P Sfe and P R *, respectively, the optimization problem in (|32l)-d38l) can 



September 12, 2012 



DRAFT 



15 



be restated as follows: 



K 



max 

T">0, {<4|Vfc} 



k=l 

f C'-^-'l u^I^^ (40, 

k=l ^ ^ S ' k ' k=0 

1 / (nt„i. i\ s q-1 



£(««*^ + A \^ HRt , v , (41) 

fc=l ^ ^ R ' k ' k=0 

£ ^ - £ A 1 -^)^- 1 ) + A ^ < Bs m _ v, (42) 

fc=0 k=l ^ ^ S ' fe ' 

£ H Rjk - £ ( 4(2 ^' fc ~ 1} + Aj^ < £ fl , ma:r , Vt; (43) 
fc=i fc=i ^ ^ R ' k ' 

J> " d *)&,* ^ £ v <? (44) 
fc=l fc=l 

4(1 - 4) = 0, (45) 

where T" = {£s,k, A s>fe , A Rifc |£; G {1,2, ••• ,A"}}. The problem in C39}-<|45]) is a non- 
convex MINLP due to the binary variables 4 and the non-convex and non-linear constraints 
(l40l)-(l45T). In the following, we propose optimal methods to solve the buffer-aided link adaptive 
offline optimization problem. 

1) Exhaustive Search: For given 4, k G {1,2, ■• • , K}, the optimization problem in (1391)- 
(1431) is convex. Therefore, we can optimize £$,k and £,R,k for given 4 £ {0, 1} very efficiently. In 
this method, we optimize £s,k and £,n,k for all possible combinations of 4, k G {1, 2, • • • , if}, 
and select from all the solutions that combination of 4> k £ {1, 2, • • • , if}, which maximizes 
the cost function. This exhaustive search method provides the global optimal solution but with 
an exponential complexity. For instance, for K time slots we have 2 K ~ 2 feasible combinations of 
4 and hence to optimize £s,k and we need to solve 2 A ~ 2 optimization problems. Therefore, 
in practice, this approach cannot be adopted in general, especially for large K. However, the 
exhaustive search scheme can be effective for small K. 
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2) Spatial Branch-and-Bound: As mentioned before, our problem is a non-convex MINLP. 
One of the recent advances in (globally) solving MINLP problems is the sBB method ifToll . ifTTl . 
The sBB method sequentially solves subproblems of problem (T39l)-(l45l). These subproblems 
are obtained by partitioning the original solution space. For each subproblem, the sBB method 
relies on the generation of rigorous lower and upper bounds of the problem over any given 
variable sub-domain. The feasible lower bounds are chosen to be the local minimizers of the 
(sub)problems whereas the upper bounds are obtained from convex relaxations. Interestingly, 
MINLP problems can be solved by using the widely available open source solver Couenne IfTTl . 
[[T8l . Couenne provides the global optimal solution for both convex and non-convex MINLP 
problems. It implements linearization, bound reduction, and branching methods within a branch 
and bound framework. For global convergence and complexity issues of sBB and therefore 
Couenne, please refer to IfTTl . 

B. Online Power Allocation 

As offline power allocation requires non-causal knowledge of CSI and harvested energies, 
we also propose two sub-optimal online power allocation schemes, which require only causal 
knowledge of CSI and harvested energy. 

1 ) Online Power Allocation Based on Average Data and Average Harvesting Rates: Unlike 
for conventional relaying, for the link adaptive protocol, the use of DP for online power allocation 
is not possible because of the link selection. For this reason, for the online power allocation 
scheme, at first we formulate an optimization problem which is based on the average data rate, 
the average energy causality constraints at S and R, and the average buffering constraint for 
K — > oo. We assume that the batteries at S and R have unlimited capacities. Considering 
the above mentioned assumptions, the optimization problem for online power allocation can be 
formulated as: 
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K 



max — ^2 dk log 2 (l + Jr^Pr^) (46) 



fc=i 

if „ if-i 

i - 

s.t 



1 ^ 1 K ~ 1 



fc=l fc=0 



L * 1 



K ^ l ' ~ K 

k=l k=0 
1 K 1 K 

— - 4) log 2 (1 + 7s, feJ Ps,fe) = xJ2 dk lo S2 C 1 + 7iU~^V) (49) 
fc=i fc=i 

-^4(1 - 4) = o, \/k (50) 



The Lagrangian of (l46l)-(l50l) is given by 



1 K A ( K \ 

L = K^ dk log2(1 + ir ^ Pr ^ - 4 S (1 ~ 4)Ps,fc " ^' fc " 1 

fc=i \fe=i / 

< if \ 1 K 

5^ - #i?,fc-i J - -^y^ /3fc4(i - 4 

vfc=l / fc=l 



K 



A 



-JC [Yl 4 k&C 1 + 7«,*^B,fc) - (1 - 4) log 2 (l + 75, fc Ps, fc ) J , (51) 
where A s , A R , /x, and (3 k are Lagrange multipliers. Differentiating (I5TI ) with respect to P s ,k, 



Pr^, and 4 and equating each of the differentiated expressions to zero leads to the following 
optimum values of Ps,k, Pr,h, and 4: 

if 7sfc > — AND 4 = 0, 
P* k = { "S 75,*' ' 6 ' fe p (52) 

0, otherwise, 



- — , if 7/j fe > i/jj AND 4 = 1, 
"* 7 *' fe ' (53) 

0, otherwise, 



VS. 

p 



(54) 



1, if (C R > C s ) OR ( 7 i?, fc > u R AND 75ifc < 
0, if (C R < Cs) OR (j R , k < u R AND 75 , fc > ^ 

where = In (^) + ^ - 1, C s = p In + ^ -p, p = M /(l -/,), ^ 5 = A s ln(2)/(l - 

/i), and vr = Aflln(2)/(1 — p). We observe that the optimal P Sj k, Pn,k, and 4 depend on 
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the instantaneous channel SNRs and Lagrange multipliers. The Lagrange multipliers can be 
solved efficiently, as shown in the next part of this section, without requiring any non-causal 
knowledge. Therefore, the optimal Ps,k, Pn,k, and dk are readily applicable in the real-time 
(online) environment with low implementation complexity. 

Finding the Lagrange Multipliers: Combining (|52l) - (j54T) and (|47|) -(|49T) yields the following 
conditions for K — > oo: 



P 



\vr 



VS lS,k 



flsAlS,k)&lS,k 



+ 



p 1 



v,s 7s,fc 



As, fc (7s,fe)d7s,fc 



f7Rk(lR,k)dlR,k = H s ,e, (55) 



vr 7R,fe 



f lR , k {lR,k)^lR,k 



+ 



LX 2 



VR iRM 



/ 7 « fc (75,fe)d75,fe 



f^R, k (lR,k)djR tk 



As, fe (75,fc)d7 



s,k — Hr,e 



(56) 



VR 



log 2 



PlS,k 



f~isAisMis,k 



p 



fjRAlR,k)djR,k 



VR 



u s 



log 2 ( ) f ls J-fs,k)djs,k 



fjn k (lR,k)dj R ,k 



vr 



log 2 I ^ ) f^ k (lR,k)d^R, k 



I 'Ft 



/7s, fc (7s,fc)d75, fc 



+ 



where L\ 



vs 



IR^-VR 

pW I _ e PTfl,fc * ( 

\ TR,fc 



pr- and L 2 



/7s ifc (75,fc)d75,fc, (57) 
-. Here, W(-) is the 



Lambert W-function ||22ll . and f lsk {pfs,k) and f lRk ( r fR,k) denote the probability density functions 
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(pdfs) of the S-R and R-D channel SNRs, respectively. For Rayleigh fading, f~ tSk ( / ys,k) = 
{l/^ s )e~ ls ' khs and f lRtk {l R , k ) = {l/i R )e-"< R < khR . We need to solve (|55]>-(|571) to find the 
optimal vg, vr, and p. The solution can be obtained by using the built-in root-finding function 
in Mathematica. We note that the optimal us, vr, and p are computed offline before transmission 
starts. When transmission begins, Pg h , Pr^ an ^ d* k are calculated based on offline parameters 
p, u s , and v R and online variables 7 S fc and 7^. 

The solution of problem (l46l)-(l50l) provides an upper bound for the practical case where the 
storage capacity of the batteries is limited. Moreover, the problem may yield Pr^ ^ even if the 
buffer is empty at R. To avoid this undesirable behavior, we propose a practical but suboptimal 
online algorithm which is summarized in Algorithm 1. At first, we calculate B S k and B Rk using 
© and ©, respectively. We then calculate P* Sk , P* Rk , and d* k from (152l)-(l54l) and (l47l)-(l49l). To 
ensure that Pg k and P Rk do not exceed the storage limits, we perform steps 6 to 8 and 11 to 
13, respectively. Steps 9 and 17 keep track of the arrival of data bits into and the departure of 
data bits out of the buffer, respectively. Steps 14 to 16 are adopted to ensure that R transmits 
only if there is data in the buffer. 

2) Naive Online Power Allocation: In the suboptimal naive power allocation scheme for 
link adaptive relaying, at each time slot, k, S and R consider the amount of energy stored in 
their batteries as their transmit powers. Based on the transmit powers, S and R compute their 
capacities. Note that the buffer status should be taken into account in the computation of the 
capacity of R. The S-R (R-D) link is selected if the capacity of S is greater (smaller) than 
that of R. 

C. Complexity 

The worst-case computational complexity of the sBB algorithm used to solve the offline power 
allocation scheme for link adaptive relaying is exponential in K ifTTIl . whereas the computational 
complexity of the exhaustive search algorithm is always exponential in K. Moreover, the worst 
case complexity of the sBB algorithm is not likely to occur for all possible realizations of 
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channel SNRs and harvested energies which is evident from the execution time results shown 
in the simulation results section. Determining the exact and/or average complexity of the sBB 
algorithm is more involved and beyond the scope of this paper. The proposed online schemes 
for link adaptive relaying have linear complexities in K. 

Algorithm 1 Online Power Allocation Algorithm For Buffer-Aided Link Adaptive Relaying 
1: Initialize the buffer status, Q = bits; 
2: for k = 1 to K do 

3: Calculate B s ,k and B R>k using © and ©, respectively. 
4: Calculate P| fc , P* Rk , and d* k using (|52l)-(|54l) and <g7M35J. 
5: if d* k = then 
6: if P| jfc > B s , k then 

7: ^fc = B s,k, 

8: end if 

9: Qfc = Qfc-i + log 2 (l + 75,*^5 lfc ); 

10: else 

11: if > B R>k then 

12: P^ fc = -Br^; 

13: end if 

14: if log 2 (l + iR, k P* Rtk ) > Qk then 

15: P * -£h=l. 

16: end if 

17: Q k = Q fc _x - log 2 (l + lR,kP* R , k ); 
18: end if 
19: end for 

K 

20: Obtain throughput = £ 41og 2 (l + l R , k P Rk ). 

k=l 
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V. Simulation Results 

In this section, we evaluate the performance of the proposed offline and online power allocation 
schemes for the conventional and link adaptive relaying protocols. We assume that the (overall) 
average harvesting rate is Hs,e = Hr ; e = He, and H$,k and Hr^ independently take values 
from the set {0, He, 2He}, where all elements of the set are equiprobable. For Figs. [2]-[6l [HI and 
HI we assume H E = 0.5. We adopt B Stmax = B R>max = B max , where B max = 4 for Figs. I2HS 
and B max = 10 for Figs. |4]-[6l M We assume i.i.d. Rayleigh fading channels with 75 = 7r = 7 
for Figs.[2]-[5]and[9]and i.n.d. Rayleigh fading channels for Figs.[6]-[8] We simulate 10 4 randomly 
generated realizations of the S-R and the R-D channels and the harvested energies at S and 
R to obtain the average throughput. 



A. Performance of Different Power Allocation Schemes for Conventional Relaying 

In this subsection, we show the performance of the proposed power allocation schemes for 
conventional relaying. In particular, the impact of the average channel SNR and the number of 
time slots on the total number of transmitted bits is studied. 
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Fig. 2. Conventional relaying: Total number of transmitted bits vs. average channel SNR 7 for K — 10. 

Fig. [2] shows the total number of transmitted bits for the power allocation schemes proposed 
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for conventional relaying vs. the average channel SNR, 7, for K = 10. We observe that for all 
considered schemes, the total throughput increases as 7 increases. We also notice that the offline 
scheme performs better than the online power allocation schemes for all 7. This is due to the 
fact that in the optimal offline scheme we assume that both causal and non-causal information 
regarding the CSI and the harvested energy are available whereas the online schemes are based 
only on causal information regarding the CSI and the harvested energy. Moreover, as expected, 
the optimal online scheme outperforms all considered suboptimal online schemes and performs 
close to the optimal offline scheme. The suboptimal online schemes DP-I 2 and DP-Ii perform 
close to each other for all 7- We note that both DP-I 2 and DP-Ix outperform the HR assisted 
and the naive schemes. 



400 




Number of time slots 

Fig. 3. Conventional relaying: Total number of transmitted bits vs. number of time slots K for 7 = 25 dB. 

In Fig. |3l we show the total number of transmitted bits for the power allocation schemes 
proposed for conventional relaying vs. the number of time slots K for 7 = 25 dB. We observe 
that the optimal offline method achieves the best performance. Among the different suboptimal 
online schemes, DP-I 2 performs best. The HR assisted scheme provides a similar performance 
as DP-I 2 for large K. This is mainly due to the fact that the HR assisted scheme is based on the 
average harvesting rate which is a more effective approach for large K. Moreover, we observe 
that the difference between the performances of DP-I 2 and DP-I1 increases with increasing K. 
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This shows that the consideration of the two next time slots instead of only the next time slot 
for calculation of the optimal transmit powers becomes more important for larger K. 

B. Performance of Different Power Allocation Schemes for Link Adaptive Relaying 

In this subsection, we show the impact of the average channel SNR and the number of time 
slots on the total number of transmitted bits for the different power allocation schemes proposed 
for link adaptive relaying. 
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Fig. 4. Link adaptive relaying: Total number of transmitted bits vs. average channel SNR 7 for K = 8 and K — 50. 

Fig. H] shows the total number of transmitted bits for link adaptive relaying vs. the average 
channel SNR 7 for K = 8 and K = 50. Here, we consider the exhaustive search offline algorithm 
for K = 8, and the online, the naive online, and the sBB offline power allocation schemes for 
both values of K. Recall that the optimal offline exhaustive search scheme is only effective for 
small K as the complexity increases exponentially with K. For K = 8, we observe that the 
exhaustive search and the sBB schemes have exactly the same performance for all the considered 
7. This observation confirms that the sBB scheme finds the global optimum solution of non- 
convex MINLP problems |fT6l , IfTTl . The performance gap between the offline and the online 
schemes is small at low 7 and large at high 7 for K — 8. Furthermore, the performance gap 
increases with 7 for K = 50. For K = 8, the naive online power allocation scheme has a small 
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performance advantage over the online algorithm for high 7, whereas for K = 50, the online 
algorithm shows better performance for all considered 7. 

C. Comparison Between Conventional and Link Adaptive Relaying 

In this subsection, we compare the power allocation schemes proposed for conventional and 
link adaptive relaying. For offline power allocation, we compare the optimal schemes and for 
online power allocation, we compare the suboptimal schemes for conventional relaying with the 
online scheme for link adaptive relaying. The optimal DP approach (for conventional relaying) 
is not included in the comparison because of its high complexity. 




Number of time slots 

Fig. 5. Comparison of conventional and link adaptive relaying: Total number of transmitted bits vs. number of time slots K 
for 7s = jr = 7 = 30 dB. 

1) Total number of transmitted bits vs. K: Fig. [5] shows the total number of transmitted 
bits vs. the number of time slots, K for the offline and online power allocation schemes for 
conventional and link adaptive relaying. We assume symmetric S-R and R-D channels, i.e., 
7s — Jr — 7- We observe that link adaptive relaying significantly outperforms conventional 
relaying for offline power allocation. The performance gap increases with increasing K. This 
is mainly due to the fact that, for large K, we have more flexibility in selecting c4, i.e., in 
selecting the S-R or R-D link for transmission to increase the system throughput. In particular, 
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for the offline case, the link adaptive relaying scheme can transmit 16 and 55 additional bits 
compared to conventional relaying for K = 20 and K = 100, respectively. For small K, the 
online scheme for link adaptive relaying does not show a better performance than the online 
schemes for conventional relaying. However, for relatively large K, e.g. K = 140, the online 
scheme for link adaptive relaying outperforms the DP-I 2 and HR assisted online schemes for 
conventional relaying by 26 and 36 bits, respectively. 




Number of time slots 

Fig. 6. Comparison of online algorithms for conventional and link adaptive relaying: Total number of transmitted bits vs. 
number of time slots K for 7s = 20 dB (Scenario 1) and 7s = 10 dB (Scenario 2) with 7_r = 30 dB. 

In Fig. [6l we turn our attention to asymmetric links where we consider two scenarios for the 
average channel SNRs. Scenarios 1 and 2 are valid for 7 s = 20 dB and 75 = 10 dB, respectively, 
where 7?? = 30 dB in both cases. We compare the performance of the online scheme for link 
adaptive relaying with those of the DP-I 2 and HR assisted schemes for conventional relaying. In 
contrast to Fig. [5j we observe that the online scheme for link adaptive relaying outperforms the 
DP-I 2 (HR assisted) schemes even for small numbers of time slots, e.g. K = 25. Moreover, Fig. [6] 
clearly shows that the performance gains of the online scheme for link adaptive relaying over 
the DP-I 2 (HR assisted) scheme for conventional relaying are significantly larger for asymmetric 
links compared to symmetric links. The larger gains are caused by the flexibility introduced by 
the buffer at the relay. In the link adaptive scheme, the stronger link can be used less frequently 
since relatively large amounts of information can be transferred every time the link is used. 
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Hence, the weaker link can be used more frequently to compensate for its poor link quality. In 
contrast, in conventional relaying, both links are used for the same amount of time regardless 
of their respective qualities. 




Average harvesting rate 

Fig. 7. Comparison of conventional and link adaptive relaying: Total number of transmitted bits vs. He for K = 60, 7s = 10 
dB, and j R = 30 dB. 

2) Total number of transmitted bits vs. He-' Fig. [7] depicts the total number of transmitted 
bits for conventional and link adaptive relaying vs. the average harvesting rate, He, for 7s = 10 
dB, ttj = 30 dB, and K = 60. We observe that the throughput increases with increasing H E 
for all considered power allocation schemes. We note that the slope of the throughput curves is 
large for small He and decreases with increasing He- This behavior is partially (apart from the 
behavior of the log(-) function) due to the fact that the performance of all schemes is limited by 
the finite storage capability of the batteries. For large He, additional energy cannot be stored in 
the batteries and therefore the extra amount is wasted. We observe that the optimal offline and 
online schemes for link adaptive relaying outperform the corresponding schemes for conventional 
relaying for all H E - 

3) Total number of transmitted bits vs. B max : In Fig. [H we show the total number of 
transmitted bits for conventional and link adaptive relaying vs. B max for 75 = 10 dB, 7/? = 30 
dB, and K = 100. We observe that for all considered power allocation schemes, the throughput 
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Fig. 8. Comparison of conventional and link adaptive relaying: Total number of transmitted bits vs. B ma x for K = 100, 
7s = 10 dB, and j R = 30 dB. 

increases with increasing B max and starting at a certain value of B max , the throughput remains 
unchanged. This can be explained by the fact that with He = 0.5 small values of B max limit 
the performance since the extra amount of harvested energies cannot be stored in the batteries. 
However, the constant throughput of all the schemes indicates that, for the given parameters, 
increasing the storage capacities of the batteries beyond a certain value does not improve the 
performance of the system. Therefore, Fig. [8] provides an indication for the required storage 
capacities of the batteries at S and R for different power allocation schemes to achieve a desired 
performance. 

4) Execution time vs. K: In Fig. [91 we show the average execution time (in seconds) vs. the 
number of time slots, K, for all offline and online power allocation schemes for conventional and 
link adaptive relaying. We ran all the algorithms on the same simulation platform. In particular, all 
simulations were performed by MATLAB with the Intel(R) Core(TM) i7-2670QM (@2.20GHz 
2.20GHz) processor. Therefore, it is justified to compare the complexity of the algorithms based 
on their execution times. We observe that for both the offline and online schemes for both 
conventional and link adaptive relaying the execution time increases with K. Moreover, the 
required execution time for the sBB algorithm is higher than that for the offline scheme for the 
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Fig. 9. Comparison of execution times of conventional and link adaptive relaying: Execution time (in seconds) vs. number of 
time slots K for 7 = 30 dB. 

conventional relaying for all K. The complexity of forming the look-up tables is not included 
in this analysis. Fig. [9] provides insightful information about the complexity-performance trade- 
off of the different proposed algorithms. For example, in case of conventional relaying, the 
HR assisted scheme is less complex than the DP-I 2 scheme with only a small performance 
degradation for sufficiently high average channel SNR and large K, see Fig. [3] Therefore, it is 
preferable to apply the HR assisted scheme compared to the DP-I 2 scheme. Moreover, although 
the worst case complexity of the sBB algorithm is exponential in K, from Fig. [9] we observe 
that its average complexity is comparable to that of the offline power allocation scheme for 
conventional relaying. Thus for link adaptive relaying, we can conclude that the sBB algorithm 
is preferable over the exhaustive search method (which is not shown in Fig. [9] due to its very 
high execution time). 

VI. Conclusions 

In this paper, we have considered the problem of transmit power allocation for single relay 
networks, where the source and the relay harvest the energy needed for transmission from 
the surrounding environment. Two different transmission strategies, namely conventional re- 
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laying and link adaptive relaying, have been considered. We have proposed several optimal 
and suboptimal offline and online power allocation schemes maximizing the system throughput 
of the considered EH systems over a finite number of time slots. Simulation results showed 
that the proposed suboptimal online schemes present a good complexity-performance trade-off. 
Moreover, we showed that, for both offline and online optimization, adopting the link adaptive 
protocol significantly improves the throughput compared to conventional relaying, especially for 
asymmetric link qualities. 
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